Accelerating AdaBoost using UCB
نویسندگان
چکیده
This paper explores how multi-armed bandits (MABs) can be applied to accelerate AdaBoost. AdaBoost constructs a strong classifier in a stepwise fashion by adding simple base classifiers to a pool and using their weighted “vote” to determine the final classification. We model this stepwise base classifier selection as a sequential decision problem, and optimize it with MABs. Each arm represents a subset of the base classifier set. The MAB gradually learns the “utility” of the subsets, and selects one of the subsets in each iteration. ADABOOST then searches only this subset instead of optimizing the base classifier over the whole space. The reward is defined as a function of the accuracy of the base classifier. We investigate how the well-known UCB algorithm can be applied in the case of boosted stumps, trees, and products of base classifiers. The KDD Cup 2009 was a large-scale learning task with a limited training time, thus this challenge offered us a good opportunity to test the utility of our approach. During the challenge our best results came in the Up-selling task where our model was within 1% of the best AUC rate. After more thorough post-challenge validation the algorithm performed as well as the best challenge submission on the small data set in two of the three tasks.
منابع مشابه
Acceleration of the gastrointestinal transit by polyethylene glycol effectively treats unconjugated hyperbilirubinaemia in Gunn rats.
BACKGROUND AND AIMS Several conditions that delay gastrointestinal transit are associated with unconjugated hyperbilirubinaemia. We hypothesised that the gastrointestinal transit time is directly related to plasma unconjugated bilirubin (UCB) concentrations, and that this relationship can be used to develop a new therapeutic strategy for severe unconjugated hyperbilirubinaemia in the Gunn rat m...
متن کاملInference of the Trend in a Partially Linear Model
In this paper, we construct the uniform confidence band (UCB) of a time-varying trend in a partially linear model. A two-stage local linear regression is proposed to estimate the time-varying trend. Based on this estimate, we develop an invariance principle to construct the UCB of the trend function. The proposed methodology is used to estimate the Non-Accelerating Inflation Rate of Unemploymen...
متن کاملUmbilical cord blood transplantation for the treatment of hematologic malignancies.
BACKGROUND The use of unrelated umbilical cord blood (UCB) has grown as an allogeneic source of hematopoietic cells for transplantation of patients with hematologic malignancies. As the number of UCB transplantation procedures has grown, an increasing number of publications have focused on disease-specific outcomes. METHODS This review focuses on the outcome data following UCB transplantation...
متن کاملGAdaBoost: Accelerating Adaboost Feature Selection with Genetic Algorithms
Boosted cascade of simple features, by Viola and Jones, is one of the most famous object detection frameworks. However, it suffers from a lengthy training process. This is due to the vast features space and the exhaustive search nature of Adaboost. In this paper we propose GAdaboost: a Genetic Algorithm to accelerate the training procedure through natural feature selection. Specifically, we pro...
متن کاملBoosting Applied to Word Sense Disambiguation
In this paper Schapire and Singer s AdaBoost MH boosting algorithm is applied to the Word Sense Disambiguation WSD problem Initial experiments on a set of selected polysemous words show that the boosting approach surpasses Naive Bayes and Exemplar based ap proaches which represent state of the art accuracy on supervised WSD In order to make boosting practical for a real learning domain of thou ...
متن کامل